Efficient parallel and incremental parsing of practical context-free languages

نویسندگان

  • Jean-Philippe Bernardy
  • Koen Claessen
چکیده

We present a divide-and-conquer algorithm for parsing context-free languages efficiently. Our algorithm is an instance of Valiant’s (1975), who reduced the problem of parsing to matrix multiplications. We show that, while the conquer step of Valiant’s is O(n), it improves to O(log n) under certain conditions satisfied by many useful inputs, and if one uses a sparse representation of matrices. The required conditions occur for example in program texts written by humans. The improvement happens because the multiplications involve an overwhelming majority of empty matrices. This result is relevant to modern computing: divide-and-conquer algorithms with a polylogarithmic conquer step can be parallelised relatively easily.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Incremental Parsing for Context-Free Languages

An incremental parsing algorithm based on dynamic programming techniques is described’. The analyzer takes the automaton generated from a general class of context-free grammars as driver, and any finite string as input. Given an input string that has been modified, the algorithm cuts out the parts of the old analysis that had been generated b y the parts of the input that has changed. What rema...

متن کامل

Operator Precedence Languages: Their Automata-Theoretic and Logic Characterization

Operator precedence languages were introduced half a century ago by Robert Floyd to support deterministic and efficient parsing of context-free languages. Recently, we renewed our interest in this class of languages thanks to a few distinguishing properties that make them attractive for exploiting various modern technologies. Precisely, their local parsability enables parallel and incremental p...

متن کامل

An E cient Context-Free Backbone for Natural Language Analyzers

Parsing eeciency is crucial when building practical natural language systems. This paper introduces an eecient context-free parsing algorithm based on dynamic programming techniques 1 , and focuses on its practical application to natural languages interfaces. Our analyzer takes a general class of context-free grammars as drivers, using the same user interface applied by the standard generators ...

متن کامل

Incremental CFG Parsing with Statistical Lexical Dependencies

Incremental parsing with a context free grammar produces partial syntactic structures for an initial fragment on the word-by-word basis. Owing to the syntactic ambiguity, however, too many structures are produced, and therefore its parsing speed becomes very slow. This paper describes a technique for efficient incremental parsing using lexical information. The probability concerning dependencie...

متن کامل

Incremental Parsing with Parallel Multiple Context-Free Grammars

Parallel Multiple Context-Free Grammar (PMCFG) is an extension of context-free grammar for which the recognition problem is still solvable in polynomial time. We describe a new parsing algorithm that has the advantage to be incremental and to support PMCFG directly rather than the weaker MCFG formalism. The algorithm is also top-down which allows it to be used for grammar based word prediction.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Funct. Program.

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2015